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TEMPLATE-DIRECTED INTERFERENCE FOOTPRINTING 
OF PROTEIN-ADENINE CONTACTS 

5 Field of the Invention 

The present invention relates generally to the study of protein-DNA interactions, and 
more particularly to a technique for inhibiting protein-adeninc noncovalent contacts without 
disrupting the secondary structure of DNA, thereby providing a template-directed interference 
technique for precisely studying protein-DNA interactions. 

to 

Background of the Invention 

The study of molecular recognition at the interface between protein and DN A is of 
paramount importance in the study of protein regulation of gene expression. Such protein-DNA 
recognition involves noncovalent interaction (hydrogen bonding, ionic interactions, and nonpolar 

15 interactions) between nitrogen bases of DNA and amino acids of protein characterized by great 
stability and specificity. 

A variety of techniques, referred to as DNA footprinting, have been developed lo study 
these interactions. These techniques typically involve chemical alteration of DNA to inhibit 
interaction with protein, exposure of the DNA lo protein, separation of protein-bound DNA from 

20 unbound DNA, and subsequent DNA cleavage and sequencing, or exposure of DNA to protein 
followed by addition of a chemical moiety that will interact with DNA except where protected by 
protein, followed by cleavage and sequencing. The results of a polyacrylamide sequencing gel 
can identify the location of protein binding in these procedures via identification of a lack of 
DNA fragments in a molecular weight region that would have been present without protein 

25 binding at a specific location, or presence of bands signifying DNA fragments that arc present 
because DNA was not allowed to bind at a specific location. 

Specifically, in "DNA protection footprinting", protein is bound lo end-labeled DNA and 
the resulting complex is treated with a reagent that interacts with DNA (but not at regions bound 
to protein) to produce chemical lesions that give rise to strand cleavage. The conditions of 

30 treatment are adjusted such that each DNA molecule receives statistically one lesion, thereby 
producing a statistical mixture of singly-modified DNA molecules each including a cleavage site 
at a random position along the DNA chain, but not at the site of protein binding. Subsequent 
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cleavage and gel sequencing of the cleaved, prolein-bound DNA, and comparison with a 
sequenced control of DNA treated identically but without protein binding, results in a lack of 
bands associated with protein binding positions. 

"DNA interference footprinting" involves treatment of end-labeled DNA with a reagent 

5 that prevents protein binding and provides a cleavage site, to produce a probe of statistically 
singly-modified DNA molecules. The pool of DNA is incubated with DNA-binding protein, 
protein-bound DNA is separated from unbound DNA, and both populations are subjected to 
cleavage conditions. The resulting fragments arc separated on a sequencing gel and, by 
comparison of the bands representing bound DNA with bands representing unbound DNA, a lack 

10 of bands in the protein-bound fraction and presence of bands in the protein-unbound fraction, 
each in a region corresponding to a molecular weight distribution of fragmentation at a particular 
location on the DNA, is indicative of protein binding at that region. 

Other footprinting techniques, such as DNase 1 footprinting, Exonuclcase HI footprinting. 
hydrpxyl radical footprinting, diethyl pyrocarbonate footprinting, KMn04 and OsO^ footprinting, 

1 5 ethylation interference footprinting, uranyl photofootprinting, methylation protection and 
methylation interference footprinting, and missing contact footprinting are known and are 
described by C. J. Larson and G. L, Vcrdinc, **Thc Chemistry of Protein-DNA Interactions", 
Bioorganic Chemistry: Nucleic Acids, S.M, Hecht, Ed., Oxford University Press, New York, 
1996, pages 324-342. Most of the above techniques do not provide information as to which 

20 groove of DNA, major or minor, is involved in binding. 

An optimal footprinting method for determining specific base contacts at Proteiri-DN A 
binding sites should possess a number of characteristics, including the following: (1) the method 
should be capable of assaying contacts to all four bases in both the major groove and the minor 
groove of DNA, (2) the structure of the interfering probe moiety should be known, (3) the 

25 interference probe should minimally perturb DNA secondary structure relative to its natural 
counterpart, and (4) the method should be operationally simple. Unfortunately, many of the 
footprinting methods described above fall short of these goals since they involve attachment of a 
chemical moiety to DNA thai can affect larger-scale phenomena such as DNA conformation. 
Accordingly, template-directed interference (TDI) footprinting (described also in the 

30 above-reference article of Larson and Verdine) has been developed to circumvent the chemical 
reagent-based approach. TDI footprinting relies upon the ability to incorporate, in the DNA 
polymer itself, a molecule that is similar enough to one of the nucleic acids to avoid disruption of 
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DNA secpndarj' structure, that possesses the ability to base-pair as would a normal nucleotide, 
but that disrupts protein-DNA interaction, and is cleavable. TDl footprinting offers significant 
advantage over others of the above-described footprinting methods in that aheration of DNA 
secondary structure is avoided, and DNA and/or protein is not induced to act in a manner 
5 inconsistent with natural protein/DNA bonding. 

Although TDI footprinting offers significant advantage in the study of protein-DNA 
interaction, availability of the necessary nitrogen base analogs for the technique is limited 
because of the challenging requirements discussed above. TDI footprinting analogs of guanine, 
cytosine, and thymine have been reported (Hayashibara, K. C, Verdine, L., J. Am, Chem, 

10 Soc, 1991, 113, 5104-5106; Hayashibara, K. C, Verdine, G. L., Biochemistry, 1992, 31, 1 1265- 
11273; Mascarenas, J. L., Hayashibara, K. C, Verdine, G. L., ./ Am. Chem. Soc. 1993, 115, 
373-374, respectively), However, although TDI footprinting has been known since at least 1991, 
and cited extensively, TDI footprinting of the DNA base adenine has remained elusive. 
Significantly, without an available, suitable TDI footprinting analog of adenine. TDI footprinting 

15 as a technique has been constrained in that it could not be used to analyze contacts to the base 
surface of the entire major groove of DNA, the principal locus of sequence-specific interactions 
in protein-DNA complexes. 

It is, therefore, an object of the present invention to provide TDI footprinting involving 
adenine. 

20 

Summary of the invention 

The present invention provides a molecule that mimics adenine, in that it can be 
incorporated into DNA without adversely affecting normal base pairing to thymine, but that 
disrupts adeninc-dcpendant DNA binding to proteins. The molecule find particular utility in a 
25 method also provided by the invention that involves study of protein-DNA interaction* 
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The molecule of the invention has a formula: 



5 




z 



(1) 



where X is a moiety allowing cleavage of double bond 12 under conditions tolerable by a 
protein-DNA complex, R is H or a hydrocarbon, and Z is an oligonucleotide, 2'-deoxyribose-5- 

10 'triphosphate, or a functionality attachable to an oligonucleotide (which can be selected by one of 
ordinary skill in the art). According to another aspect of the invention, an oligonucleotide, such 
as DNA, including at least one nitrogen base of the above formula is provided. 

A method of the invention involves a technique for determining contncls in a prolcin- 
DNA complex. A plurality of cnd-labcled DNA strands are provided, each including, 

15 statistically, a nitrogen base of the above formula. A DNA-binding protein is added to the 

plurality of strands, protein-bound DNA is separated from unbound DNA, and the DNA strands 
are cleaved. The strands then arc sequenced to determine a molecular weight region populated 
predominately by the unbound strands, representative of cleavage sites indicating a protein- 
binding site. 

20 The invention also provides a method of determining contacts in a protein-DNA complex, 

involving investigation with TDI footprinting analogs of all four DNA bases. The technique 
involves providing four groups of DNA. The first three groups include, respectively, a plurality 
of end-labeled DNA strands that each have, statistically, a TDI footprinting base analog of 
guanine, cytosine, or thymine. The fourth group includes a plurality of end-labeled DNA strands 

25 that each have, statistically, a TDI footprinting base analog of adenine having a formula I, above. 
A DNA-binding protein is added to each group, protein-bound DNA is separated from unbound 
DNA, the DNA strands are cleaved, and the strands are sequenced to determine a molecular 
weight region populated predominately by the unbound strands and representative of cleavage 
sites indicating a protein binding site. Provision of, and use in TDI footprinting of, TDI 

30 footprinting base analogs of guanine, cytosine, and thymine is described in the above-noted 
articles of Hayashibara, K. C, Verdine, G. L., J. Am. Chem. Soc, 1991, 1 13, 5104-5106; 
Hayashibara, K. C, Verdine, G. L.. Biochemis/ry, 1992, 31, 1 1265-1 1273; Mascarenas, J. L., 
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Hayashibara, K. Verdine, G. L.,J, Am, Chem. Soa, 1993, 115, 373-374, respectively, all 
incorporated herein by reference. 

Other advantages, novel features, and objects of ihe invention will become apparent from 
the following detailed description of the invention when considered in conjunction with the 
accompanying drawing. 

Brief Description of the Drawinyy 

Fig. 1 is a photocopy of nondenaturing polyacrylamidc gel electrophoresis (PAGE) of 
mixtures containing a pool of the inventive adenine analog - substituted DNA and various 
concentrations of protein. 

Dctaiied Description of the Invention 

Min, C„ Gushing, T. Verdinc, G., "Template-Directed Interference Footprinting of .. 
Protcin-Adenine Contacts,", y. Am. Chem. Soc. 1996, 1 18, 61 16-6120; and Min, C. Verdine. G., 
"Immobilized Metal Affinity Chromatography of DNA," Nucleic Acids Res, 1 996, 24, 
3806-3810, both are incorporated herein by reference. 

The role of hydrogen bonding in protein-DNA interactions is very important, as hydrogen 
bonds normally make up a large fraction of the total number of contacts in a protein-DNA 
complex. Although it is generally accepted that hydrogen bonds contribute only modestly to the 
overall energetics of specific complcxation, they serve a critical role in determining the 
specijicify of protein-DNA interactions. Formation of matched sets of bonds including hydrogen 
bonds to DNA bases of the optimal binding site allows the entire protein to come into intimate 
contact with the DNA surface, permitting the formation of a large number of energetically 
favorable nonspecific contacts. 

The present invention provides an analog of the DNA base adenine, having the 
generalized formula I, as described above. When R is selected as H, either X is selected so as not 
to be amenable to hydrogen bonding (so as not to hydrogen-bond with an adjacent species) or X 
is selected so as to hydrogen bond with R so as to prevent hydrogen bonding between R and an 
adjacent species. When X is selected so as not to be amenable to hydrogen bonding, it is, for 
example, selected from moieties other than F, N, and O. When R is selected as a group not 
including a H amenable to hydrogen bonding, X can be a moiety amenable to hydrogen bonding. 
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. R and X should be selected, together, so as not to alter the conformation of species I in a 
manner that would prevent species I from participating in normal Watson-Crick bonding. That 
is, they should be selected so as not to sterically interact in a way that would cause species I to 
twist appreciably. 

5 Preferably, R is H or a hydrocarbon group (including cyclic hydrocarbon groups), 

optionally interrupted by hetero groups. As used herein, "hydrocarbon'' is meant to include 
alky], alkenyl, alkynyl, cycloalkyl, aryl, alkaryl, aralkyl, and the like. Hetero groups can include 
-0-, -CONH-, -CONHCO-, -NIK, -CSNH^, -CO-, -CS-, -S-, -SO-, -(0C1UCH2)„R (where n = l. 
10), -(CF2)^- (where n = 1-10), olefins, and the like. Most preferably, R is a small moiety such as 

10 H or CHj. R should be selected so as not to interfere with hydrogen bonding between H 1 0 and 
thymine. 

X is a moiety allowing cleavage of double bond 12 under conditions tolerable by a 
protein-DN A complex. Those of ordinary skill in the art can routinely select X according to this 
criterion. Typically, X has electron-withdrawing capacity sufficient to render double bond 12 

15 readily clcavable by, for example, aqueous base, so that the TDI footprinting technique described 
above, and described in the above-referenced article of Larson and Verdine can be carried out. In 
this case X is, preferably, an electron-withdrawing group such as -CN, -Br, -CI, -NO,, -SO3* - 
SOjR, -COOH, -F, or the like. Alternatively, X can be an electron-donating group including 
electron-donating capacity sufficient such that bond 12 can he cleaved by permanganate, 

20 followed by base. In this case, X can be -()CH3, -CH3. -OH, -Ar (aromatic), -NHCOCH^, or the 
like. 

R and X can, together, define a connected, or cyclic, moiety, as will be appreciated by 
those of skill in the art. 

According to one, particularly preferred embodiment where R is H, X is -NO2 and 
25 interacts by hydrogen bonding with R, preventing hydrogen bonding between R and protein. 
The molecule I of the invention satisfies the following criteria. First, as part of an 
oligonucleotide, such as DNA, it does not disrupt secondary structure of the oligonucleotide. 
Second, it is cleavable chemically at double bond 12 so as to find utility in TDI footprinting. 
Third, it disrupts functionality with protein, achievable when the selection of X and R is made 
30 according to the criteria above. Finally, it is amenable to normal base pairing with thymine, that 
is, atoms involved in Watson-Crick hydrogen bonding are not be altered. 
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Referring now to structure II, Watson-Crick hydrogen bonding between adenine 14 and 
thymine 15 in DNA is illustrated. Adenine 14 



includes nitrogen 16 and hydrogen 18, involved in hydrogen bonding with thymine, which 
hydrogen bonding is not disrupted in accordance with the invention. Natural adenine includes as 
well hydrogen 20 and nitrogen 22 (including representative electron cloud 24), both of which are 
amenable to hydrogen bonding with protein interacting with DNA, at least one of which is 
prevented front hydrogen bonding in accordance with the invention. 

Referring now to arrangement III, adenine 14 and thymine 15 arc shown as paired in 
DNA, and a portion 26 of the protein glutamine is illustrated including hydrogen 28 which is 
hydrogen-bonded to nitrogen 22 of adenine, and oxygen 30, hydrogen-bonded to hydrogen 20 of 
adenine (see Larson and Verdine, "The Chemistry of Protein-DN A Interactions" referenced 
above, page 329). p 




(11) 





CH3 



(III) 
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II is one of these hydrogen bonds, between hydrogen 28 and nitrogen 22, or between oxygen 30 
and hydrogen 20, that is to be disrupted in accordance with the invention. Preferably, both are 
disrupted. 

According to one preferred embodiment of the invention (with reference to fomiula I), X 
is -NO2, and R is Vl This molecule, illustrated as formula IV below, allows for hydrogen 
bonding between X and R, namely between oxygen 32 and hydrogen 34 (R), precluding any 
hydrogen bonding between X or R and a protein. 



The function and advantage of these and other embodiments of the present invention will 
be more fully understood from the examples below. The following examples are intended to 
illustrate the benefits of the present invention, but do not exemplify the full scope of the 
invention. References cited in the following example is incorporated herein by reference for ail 
purposes. 

Example 1 : Synthesis of Analog Su itable for TDI Footprinting of Mg^ior Groove Contacts to 
Adenine 

This and the following examples are directed toward the synthesis and validation of an 
adenine analog suitable for TDI footprinting. A species having the generalized formula (I), 
specifically 7-deaza-7-nitroadcnine. 

Materials: 7-Deaza-2 -deoxyadenosine (2'-deoxytubcrcidin) (Coltam, H. B.; 
Kazimicrrczuk, Z.; Geary, S.; McKeman, P. A.; Revankar, G.R.; Robins, R. L., J. Med Chcm, 
1985, 28 1452-1467) was obtained from Bochringcr Mannheim, Naturally occurring dNTPs 
were from Pharmacia (Piscataway, NJ). Taq polymerase was from Promega (Milwaukee, Wl). 
T4 polynucleotide kinase was from Gibco BRL (Gaithcrsburg, MD), and glycogen was from 



32. 3y 




(IV) 
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Boehringcr Mannheim (Indianapolis, IN). Sequcnasc 2.0 and the Scquenase 2.0 kit were from 
United States Biochemical (Cleveland, OH). [y-^^PJATP was from New England Nuclear. Full- 
length 434 repressor was expressed in Escherichia coli using XA90/pRWl 90 and purified as 
described (Anderson, J.; Ptashne, M.; Harrison, S.C. Proc, Nati Acad Sci. U.S.A. 1984, 81, 
5 1307-1311). 

Synthesis of 4-Amino-7i»(2-deoxy-3,5-diacetoxy-6-D-erv/Aro- 
pentofuranosyl)pyrrolo[2,3-d]pyridiminc (2): To a stirred suspension of 4-amino-7-(2-deoxy-6- 
D-^r>'//?rf;-pentofuranosy])poyrroior2j3-d]pyrimidine (2'-deoxytubercidin, 1) (Cottam et al., 
above) in pyridine (1 .6 mL, 0.4M) at O'^C under N2 was added acetic anhydride (0.33 mL, 3.5 

10 mmol. 5.3 cquiv). The mixture was allowed to stir overnight, reaching ambient temperature in 
that time. The solution was diluted with methanol (3 mL), stirred briefly, reduced to dryness, 
and was placed on the vacuum line for a few moments. The residue was then subjected to a 
refluxing solution of methanol (3 niL) for 12 h. The solution was again reduced to dryness, 
yielding 240 mg of a crude solid. Chromatography: flash column (20 g of silica) 2% 

15 MeOH/CHjClj to 5% MeOH/CHjCl,. The product was obtained as a white foamy solid. Yield: 
187 mg, 85% 'H NMR (300 MHZ) (CDCI3 relative to TMS) 6 2.09 (6H, s); 2.48 (IH, ddd, J-2.K 
4.7, 14.0 Hz); 2.59 - 2.69 (IH, m); 4.27-4.37 (3H, m); 5.31-5.34 (IH, m); 5.61 (2H, br ) 
exchanged); 6.41 (IH, d, J=1.0, 3.7 Hz); 6.68 (IH, dd, J=5.7, 8.7 Hz); 7.1 1 (IH, 3, J=3.7 Hz); 
8.29 (IH, s). ''C NMR (126 MHZ) (CDCl3):6 20.76, 20.87, 37.44, 64.02, 74.63, 81.54, 83.49, 

20 99.44, 103.89, 120.89, 140.6?; 151.98, 156.88, 170.38. IR (thin film, CDCU). 3152, 1740, 1734, 
1684, 1653, 1233 cm-'. MS (El): m/e (rel intens) 334 (M\ 0.1); 275 (0.5); 134 (100); 107(0.15). 
El HUMS (Cr)(M+l): 335.1355 (C,5H,«NA requires 334.3328). 

Synthesis of 4-Amino-5-nitro-7-(2-deoxy-3,5-diacetoxy-p-D-er)///2ro- 
pentofuranosyl)pyrrolo[2,3-d]pyridiminc (3): To a vigorously stirred 0°C solution of 2 (97.9 

25 mg, 0.29 mmol, 1 .0 cquiv) in CH2CI2 ( 1 .9 mL, 0. 1 5 M) was added a mixture of concentrated 
H2SO4 (320 mg) and fuming HNO3 (260 mg) in a dropwise fashion. After 20 min the ice bath 
was removed and the biphasic solution stirred for an additional 20 min and then neutralized with 
a saturated NaHCOj solution until the pH reached 7.0. The mixture was extracted with CH2CI2 
(-200 mL), and the bright yellow solution was washed with brine, dried over MgS04, and 

30 reduced to dryness. Crude yield: 6.33 mg, 60%. The resulting dark yellow solid was used 

without further purification, 'H NMR (300 mhz) (CDCI3): 62.09 (3H, s); 2.14 (3H, s); 2.49-2.58 
(IH, m); 2.68 (IH, ddd, J=3.1, 6.0, 17^2 Hz); 4.34 (3H, br s): 5.31 (IH, m); 6.40 (lH.br s, D.O 
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exchanged); 6.62 (IH, t, J-6.4 Hz); 7.56 (IH hrs 0,0 exchanged); 8.26 (IH, s); 8.276 (IH, s). 
^^CNMR (126 MHZ) {CDCI3): 620.69, 20.79, 38.82, 63.53, 73.83, 82.62, 84.62, 95.74, 95.78, 
125.15, 129.55, 149.56, 154.12, 156.41, 170.23, 170.30 IR (thin film, CDCl3):3I50, 1740, 1734, 
1684, 1653, 1570, 1522, 1506, 1234 cm '. MS (CI): ni/e (rel intens) 380 (M= 1, 0.2); 294 (0.60); 
5 236(100). 

Synthesis of 4-Amino-5-nitro-7-(2-deoxy-P-D-erythro-pentofuranosyl)pyrrolo[2,3- 
djpyridiminc (4): To a stirred solution of 3 (1 7.4 mg„ 0.046 mmol, 1 .0 equiv) under at 0°C 
in methanol (1.3 mL) was added H2CO3 (13 mg, 0.10 nimol, 2.2 equiv. After 20 min the reaction 
mixture was diluted with brine (25 mL) and extracted with ethyl acetate (100 mL). The aqueous 

10 layer was then re-extracted with ethyl acetate (3 x 50 mL), and the organic fractions were 

combined, dried over MgSOj, and evaporated to yield a bright yellow solid. Chromatography: 
PTLC 5% MCOH/CH3-CI2. Yield: 8.8 mg, 65% 'H NMR (300 MHZ) (melhanol-d,): 62.42-2.48 
(IR m): 2.53-260 (IH, m); 3.73-3.87 (2H, m); 4.03 (1 H, t, J=2.9 Hz); 4.53 (1 1 1, dd. J=l 8, 2.9 
. Hz); 4.86 (2H, s, 0,0 exchanged); 4.87 (2H, 5,0,0 exchanged); 6.57 ( 1 H, dd, J=4.4, 6.2 Hz); 

15 8.17(lH,d,J=2.5Hz);8.70(lH,d,J-2.5H2). ^^C NMR (126 MHz) (methanol-d^): 6 41.73, 
62.75. 72.14, 86,87, 89.22, 96.88, 128.70, 130.12, 150.12, 154.75, 157.96, lR(thin film, 
MeOH): 34450335 br. 1684, 1653, 1570, 1522, 1507, 1269 cm ', MS (El): m/c (rcl intens) 295 
(M\ 0.1); 200 (0.25); 179 (100). El HRMS: m/e 295.0904 (C,,H,:,N50, requires 295.0917). Mp: 
1 86-1 90 ^C. 

20 Synthesis of 7-deaza-7-nitro-2 -deoxyadenosine 5'-Triphosphate (dA*TP,5): To a stirred 

solution of 4 (4.1 mg, 14 yLimol, 1 .0 equiv) at 0°C under in dry trimethyl phosphate (50 ^L) 
was added POClj (1.7 ^^L, 18 ^umol), 1.3 equiv). After 15 h a solution of tris- 
(tributylammonium) pyrophosphate (0.14 mL, IM, in DMF) was added to the reaction flask, and 
stirred overnight (0°C to room temperature). At this time Ei^^N (43 ^L) and H2^'d;> (612 fxL) 

25 were added to the flask and stirred for 5 min. The mixture was lyophilizcd and the residue 

dissolved in 100 yuL of H20-d2 and passed through an ion exchange column (Sephadex A25, 1 g) 
equilibrated in 0.1 M aqueous triethyl-ammonium bicarbonate (TEAB). The column was 
washed with 50 mL of H^O-dz and then elutcd with 950 mL gradient from 0 to 2M 
triethylammonium bicarbonate, with 10 mL fractions being collected. Fractions 1 1-27 were 

30 combined and lyophilized to dryness, and the remaining white powder was redissolved in 1 mL 
of TE buffer: 10 mM Tris-HCl (pH 8.0), 1 mM EDTA, This solution was used directly in 
extension reactions with Sequcnases 2.0. The yield was estimated to be 5.7 ^tmol (41%) by 
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spectrophotometric analysis using the reported extinction coefficient = 365 nm, 6=3900) 
for the related compound 5-nilrotubercidin^ ^'^P NMR (202 MHz) (0,0 referenced to 
phosphoric acid): 5-S.4(d); -6.1(m); -19.5(m). UVCH^O): X^,, - 356, 364 nm. 

Example 2: Confirmation of the Ability of the Species of Example 1 to Promote Rir^ g 
Fragmentation and DNA Strand Scission 

7-deaza-7-nitro-2 -deoxyadeninc (dA*) was synthesized by nitration of 2- 
■ deoxytubercidin. A sample of dA* was dissolved in IM aqueous pipcridine at 90 °C, and 
aliquots were periodically removed and spotted onto a TLC plate. Analysis of the resulting TLC 
plate revealed virtually complete disappearance of the dA* chromophorc within 30 miti. These 
data indicated that the base moiety of dA* undergoes hydrolytic degradation under the conditions 
used in piperdine-cataiyzcd cleavage of DNA, thus it is possible to cleave DNA selectively at 
positions containing the analog. 

Example 3: Incorporation of 7-deaza-7^mtroadenine rdA*^ into DNA 

Preparation of Single-Stranded Template DNA by PCR/IMAC (PCR/metal affinity 
chromatography): Complementary oligonucleotides containing the operator and flanking 
sequences used in the co'stallographic study of the 434 repressor (N-terminal domain)/ORl 
complex were synthesized with 3 -Pstl-compatible ends, ligated into the PstI site in the 
poly linker of the phagemid pBS+ (Stratagenc), and transformed into £. coli strain XL 1 -blue 
(Stratagene). Insert-positive recominants (designated pBS-434) were verified by sequencing of 
single-stranded phagemid DNA isolated from helper phage VCS-Ml 3 (Stratagene). 
Hayashibara, K,C. PhD. Thesis, Harvard University, Cambridge, MA 1993. 

The polymerase chain reaction followed by immobilized metal affinity chromatography 
(PCR/IMAC) was used to generate single-stranded DNA templates containing the operator 
sequence. This technique is reported in a co-pending, commonly-owned United States 
provisional patent application of Verdine, et al., filed July 3. 1996 (Serial No. 60/021,195) and in 
Min, C, Verdine, G., "Inunobilized Metal Affinity Chromatography of DNA," Nucleic Acids 
Res, 1996, 24, 3806-3810, both incorporated herein by reference. In the technique, a moiety that 
can coordinate a metal ion is inserted into DNA. Specifically, DNA sequencing primers 
containing six successive 6-histaminylpurine residues added onto the 5'-end of the DNA was 
synthesized. The PCR/IMAC technique is preferred, although other techniques can be used, 



wo 98/00433 ,^ PCT/US97/11684 

-12- 

such as an asymmetric PCR technique. Alternate techniques are discussed in the Min, et al., 
paper, Nucleic Acids Res., 1996, 24, 3806, referenced above. 

The 6-histaminylpurine moiety was introduced through the convertible nucleotide 
approach by analogy to reported procedures (Ferentz, A.E.; Vcrdine, CX., 1 Am. Chem. Soc. 
5 1991, 113, 4000-4002; Ferentz, A,E.; Verdine, G.L. In Nucleic Acids and Molecular Biology; 
Eckstein, F., Lilley, D.MJ. Eds.; Springer-Verlag: Berlin, 1994; Vol. 8, pp. 14-40. Briefly, 
resin-bound oligonucleotides containing 0*-phenyU2'-deoxyinosine (4)dl) in place of the dll 
residue were deprotected from the resin by mild ammonia treatment (cone NH^OH, nt, 4b) 
lyophilized to dryness, then treated with 5 M aqueous histamine (55 °C, 14 h) to convert the 4)dl 

10 residues to dH. The crude oligonucleotides were purified by denaturing polyacrylamide gel 
electrophoresis. An additional dG residue was added at the 5'-end to ensure efficient end- 
labeling using polynucleotide kinase. 

The l lfi-tag confers, upon an oligonucleotide, the ability to be selectively and revcrsibly 
retained on a surface, specifically Ni-"-NTA-agarose. The result is a single strand of DN A 

15 immobilized at a surface. 

Following removal of the supernatant, which contains unbound DNA, the resin was 
washed and the bound DNA ciutcd with 200 mM imidazole. 

Amplification was carried out in a 100 PCR reaction mixture containing 50 mM KCl, 
10 mM Tris-HCl (pH 8.4), 1.5 mM MgClj, 0.25 mM (each) dATP, dCTP, dCTP, and TTP, 1 

20 unit of Taq polymerase, 20 Mg of pBS-434 as template, 50 pmol of llfc-tagged Ml 3 (-48) primer 
f5'-d(GHHHHHHAGCGGATAACAATTTCACACAGG)^3'l, H = 6-histaminylpurinc, and 50 
pmol of Ml 3 (-47) primer [5'-dCGCCAGGGTTTTCCCAGTCACGA)-3']. The reaction 
solution was overlaid with a 60 ixL mineral oil cap to prevent evaporation and placed in a 
thermal cycler (MJ Research) programmed for 35 cycles set at 94°C (1 min), 55°C (I min) and 

25 72°C(lmin). 

The duplex PCR products were resolved by IMAC as follows: 1 mL of TE buffer was 
added to the crude reaction mixture, which was then concentrated to a total volume of 50 a^L 
using a centrifugal dialysis cartride (Centricon 30, Amicon). The DNA solution was transferred 
to an Eppendorf tube, to which was added 1 50 juL of binding buffer (6 M guanidine-HCl, 1 0 mM 
30 Tris, pH 8.2). This mixture was heated for 5 min at 90**C. Separately, to a 1 .5 mL Eppendorf 
tube was added 250 a^L (bed volume) of NP*-NTA agarose resin and 1 mL of binding buffer at 
room temperature. The hot DNA solution was added to the suspension of resin, and the resulting 
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mixture was mixed for 1-1.5 min by vigorous shaking. The mixture was transferred to an empty 
5 mL fitted column (Qiagen), and the flow-through fraction, which contained the unmodified 
strand, was collected into an Eppendorf tube. After repeated pipetting to ensure complete 
mixing, the unbound fractions were aliquotted into four Eppendorf tubes (300 each) and held 
5 aside for further processing. Next, the resin was washed with 1 mL of washing buffer (10 mM 
Tris, 5 mM imidazole, pH 8.0), which was discarded. The H^-containing strand was then eluted 
in 1 .2 mL of 200 mM aqueous imidazole solution. The imidazole eluate was mixed thoroughly 
and aliquotted into four Eppendorf tubes (300 /uL each), to each of which were added 30 juL of 
100 mM ethanolic 1,10-phenanihroline and 30 /^L of 3 M aqueous NaOAc. To each of the 

10 unbound and bound fractions was added 900 ^L of absolute ethanol (stored at -20''C), and then 
the lubes were vortexed briefly and chilled for 30 min of powdered solid CO7. The tubes were 
microcentrigued for 30 min at ISOOOg. The supernatant was removed and the pellet washed with 
200 mL of 80% aqueous EtOH (-20^*0. Following removal of the ethanol solution, the tubes 
were evaporated to dryness by centrifugal lyophilization (Speed Vac, Savant). To each dry tube 

15 was added 50 /zL of TE buffer, and the DNA concentration was quantified by UV 

spectrophotometry. This procedure typically yields 5-10 pmol of the H^-tagged template and the 
unmodified template (the primers are used in -5-10 fold excess in the PGR reaction). 

Template-Directed Extension: The procedure provides a sufficient template for 20 lanes 
on a DNA sequencing gel. 

20 (a) End Labeling of Primer: A 20 ^L volume of 0.2 pmol//^L M13 (-21) primer, 5*- 

d(GTAAAACGACGGCCAGT)-3\ was end labeled in a 100 f^L volume by adding 10 ^L of 400 
mM Tris-HCl (pH 7), 200 mM MgClj, 20 units of T4 polynucleotide kinase, and 4 fiL of 6000 
Ci/mmol [y-^'P]ATP. This mixture was incubated at 37°C for 30 min, heated at 90°C for 3 min. 
chilled on ice, and briefly microcentrifuged at 14000 rpm. 

25 (b) Annealing: A 100 fiL volume of the DNA template strand (approximately 0.04 

pmol//uL in TE) was added to the end-labeled primer (100 /xL) and heated to 65 ""C for 5 min. and 
then allowed to reach room temperature while remaining in the bath. The condensate was 
recombined with the annealed primer/template solution by brief microcentrifugaticn. 

(c) Primer Extension: A 200 sample /A. of annealed primer/template (4 pniol) was 

30 combined with 70 of polymerization buffer [48 mM Tris-HCl (pH 7), 25 mM MgCI,, 200 
mM NaCl, 9mM DTT, and 400 fxg/fiL BSA). The resulting solution was added to 200 fxL of a 
mixture containing 80 f^M each of dCTP, dGTP, and dTTP; and 80 ;/M (total nucleotide) 1 :3 
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mixture of dA*TP and dATP; and 50 /xM NaCL. The extension was allowed to proceed at 37 ^^C 
for 20 min and then stopped by addition of 200 /xL of 3 M sodium acciate (pH 7.0), and 
precipitated at -70 °C with 20 of 20 mg/mL glycogen and 1800 of 100% ethanol. The 
pellet obtained after microcentrifugation at 14000 rpm for 30 min at 4°C was washed with 80% 
5 aqueous ethanol and dried by lyophilization. The resulting DNA was dissolved in 80 jxL of TE, 
4 of which was then used for each subsequent incubation with protein. 

Example 4: Binding Studies InvolvinL^ Adenine Analop of the Invention Incorporated into nN>\ 
1 mM CaCl2, 100/ig/mL BSA, and 10% (v/v) glycerol: Protein-bound and unbound 

10 DNA were separated using a 5% low-ionic-slrength nondenaturing polyacrylamide gel. 

(Chodish, L.A. In Current Protocols in Molecular Biologyr Ausabel, F.M., Brent, R. Kingston, 
R,E-, Moore, D.D., Smith, J. A. Seidman, J.G., Struhl, K., Eds; John Wiley and sons: New York, 
1989; Vol. 2, pp 12.2.1 - 12.2.10). (37:1 monomer;mcthylenebisacryl-amidc). The gel was 
exposed to BioMAX film (Kodak) for 5 h. Bands were visualized by autoradiography and 

15 excised from the gel using a razor blade, llie gel slices were elcctroclulcd into 1 30 /zL of 1 OM 
ammonium acetate using a V-channel apparatus (Harvard BioLabs) with 0.5x TBE [45 mM Tris- 
borate (pH 8.0), 0.5 mM EDTAJ as running buffer. The DNA samples in 10 M ammonium 
acetate were precipitated directly by the addition of 3 Volumes of 100% ethanol followed by 
precipitation as described above. 

20 Cleavage and Sequencing Procedure. Each ^-P-labeled dsDNA sample was redissolvcd in 

50 /iL of 1 M aqueous piperidine. This solution was heated at 90 ""C for 30 min, cooled to room 
temperature, and microcentrifugcd briefly. Samples were lyophilized, twice resuspended in 
30/^L of H^O-d^ and lyophilized and then redissolvcd in 3 fxL of TE and 2 of loading dye 
containing 95% formamine, 10 mM EDTA, 0.025% bromophcnol blue, and 0.025% xylene 

25 cyanoi. Samples were heated at 90°C for 2 min and then loaded immediately onto a 7% 

polyacrylamide sequencing gel. The gel was visualized by autoradiography. The generation of 
an authentic sequence by Sanger sequencing made use of the Sequenase 2.0 kit (United States 
Biochemicals) according to the manufacturer's protocol. A volume of 2.5 fxL of annealed 
['^Pjprimcr;tcmplate mixture was used in each didcoxy termination reaction. 

30 High resolution electrophoretic analysis of the products arising from extension and 

cleavage resulted in a pattern of bands of the A* lanes matching precisely that in the Sanger 
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dideoxy-A sequencing lane, thus indicating that the analog shows the same base-pairing 
preference as its natural counterpart. 

The titration series was fractionated by nondenaturing polyacrylamide gel electrophresis 
(PAGE) , which separates naked DNA from protein-bound DNA on the basis of differences in 
5 size (Fig. 1). The bound and unbound DNA Bands were extracted from the native PAGE geK 
cleaved with piperidine, and analyzed on a high-resolution denaturing polyacrylamide gel. 
Interference was evident in a TDl-A footprint as positions in the sequence at which a band was 
weak or absent from the bound lanes and enriched in the unbound lanes, reltivc to the A* control 
Such behavior was clearly observed at one and only one position in the sequence: the 5'-A 

10 residue of 0^1 . A band at A 1 wais virtually absent from the bound lanes at all protein 

concentrations, thus indicating that binding by 434 repressor was strongly and adversely affected 
by the analog at this site. Conversely, the relative intensity of the Al band in the unbound lanes 
. increased as the protein concentration was increased, again suggesting that DNA molecules 
having the analog at position -1 bind 434 repressor poorly. 

15 Those skilled in the art will readily appreciate that ail parameters listed herein are meant 

to be exemplary and actual parameters will depend upon the specific application for which the 
methods and apparatus of the present invention are being used. It is, therefore, to be understood 
that the foregoing embodiments are presented by way of example only and that, within the scope 
of the appended claims and equivalents thereto, the invention may be practiced otherwise than as 

20 specifically described: 

What is claimed is: 
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CLAIMS 

1 . A species of a formula: 




where X is a moiety allowing cleavage of double bond 12 under conditions tolerable by a 
prolein-DNA complex, R is H or a hydrocarbon, and Z is an oligonucleotide, 2'-deoxyribosc- 
5'triphosphatc, or a functionality attachable to an oligonucleotide. 

2. A species which is an oligonucleotide including at least one nitrogen base having 
a formula: 




Z 



where X is a moiety allowing cleavage of double bond 12 under conditions tolerable by a 
protein-DNA complex, R is H or a hydrocarbon, and Z is the oligonucleotide. 



3. A species as in either of claims 1 or 2, wherein R is H and X is selected so as not 
to be amenable to hydrogen bonding, and X and R are selected so as not to sterically interact in a 
way that would cause the species to twist appreciably and thereby to prevent the species from 
participating in normal Watson-Crick bonding. 
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4. A species as in either of claims 1 or 2, wherein R is H, and X is selected so as to 
hydrogen bond with R, and X and R arc selected so as not to sterically interact in a way that 
would cause the species to twist appreciably and thereby to prevent the species from participating 
in normal Watson-Crick bonding. 

5. A species as in either of claims 1 or 2, wherein R is a group not including a H 
amenable to hydrogen bonding, 

6. A species as in any of claims 1 -4, wherein R is H or a hydrocarbon group. 

7. A species as in claim 6, wherein R is 1 1 or CH3. 

8. A species as in any precccding claim, wherein X has electron-withdrawing 
capacity sufficient to render the double bond adjacent the carbon of the ring to which it is 
coupled clevable by aqueous base. 

9. A species as in claim 8, wherein X is -CN, -Br, -CI, -NO2, -SO3" -SOjR. -COOH, 

or-F. 

10. A species as in any of claims 1-8, wherein X is -OCH3, -CHj, -OH, an aromatic 
group, or -NHCOCH3. 

1 1 - A species as in any preceeding claim, wherein R is H and X is -NO2. 

12. A species as in any preceeding claim, wherein Z is an oligonucleotide. 



13. A method of determining contacts in a protein-DNA complex, comprising: 
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providing a plurality of end-labeled DNA strands each including, statistically, a 
nitrogen base of a formula: 



where X is a moiety allowing cleavage of double bond 12 under conditions tolerable by a 
protein-DNA complex, R is H or a hydrocarbon, and Z is an oligonucleotide, 2'-deoxyribose- 
5'triphosphate, or a functionality attachable to an oligonucleotide; 

adding DNA-binding protein to the plurality of strands; 

separating protein-bound from unbound DNA; 

'cleaving ihc DNA strands, and sequencing the DNA strands (o determine a 
molecular weight region populated predominately by the unbound strands and representative of 
cleavage sites indicating a protein binding site. 



1 4. A method as in claim 13, further comprising determining contacts in a protein- 
DNA complex involving investigation with analogs of all four DNA bases, comprising: 

providing a first group of DNA including a plurality of end-labeled DNA strands that 
each have, statistically, a TDl footprinting base analog of guanine; 

providing a second group of DNA including a plurality of end-labeled DNA strands that 
each have, statistically, a TDl footprinting base analog of cytosinc; 

providing a third group of DNA including a plurality of end-labeled DNA strands that 
each have, statistically, a TDl footprinting base analog of thymine; 

providing a fourth group of DNA including a plurality of cnd-labcled DNA strands that 
each have, statistically, a TDl footprinting base analog of adenine having the formula 1; 

adding DNA-binding protein to the plurality of strands of each group; 

separating protein-bound from unbound DNA in each group; 

cleaving the DNA strands of each group; and 
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sequencing the DNA strands of each group to determine a molecular weight region 
populated predominately by the unbound strands and representative of cleavage sites indicating a 
protein binding site. 

5 

15. A method as in either of claims 1 3 or 14, wherein R is M and X is selected so as 
not to be amenable to hydrogen bonding, and X and R are selected so as not to sterically interact 
in a way that would cause the species to twist appreciably and thereby to prevent the species 
from participating in normal Watson-Crick bonding, 

10 

16. A method as in either of claims 13 or 14, wherein R is and X is selected so as 
to hydrogen bond with R, and X and R are selected so as not to sterically interact in a way that 
would cause the species to twisi appreciably and thereby to prevent the species from participating 
in normal Watson-Crick bonding. 

t5 

1 7. A method as in either of claims 13 or 1 4, wherein R is a group not including a H 
amenable to hydrogen bonding. 

18. A method as in any of claims 13-17, wherein R is H or a hydrocarbon group. 

20 

1 9. A method as in claim 1 8, wherein R is H or CH,. 

/" 

20. A method as in any of claims 13-19, wherein X has electron-withdrawing capacity 
sufficient to render its adjacent double bond clevable by aqueous base. 

25 

21 . A method as in claim 20, wherein X is -CN, -Br, -CI, -NOj, -SOi" -SO^R, - 
coon, or -F. 

22. A method as in any of claim 13-19, wherein X is -OCH3, -CII3, -Oil, an aromatic 
30 group, or -NHCOCH3. 



23. 



A method as in any of claim 13-22, wherein R is H and X is -NO2. 
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24. A method as in any of claims 13-23, wherein Z is an oligonucleotide, 2- 
deoxyribose-5'triphosphale, or a functionality attachable to an oligonucleotide. 
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